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Abstract — We study algorithms for carrier and rate allocation 
in cellular systems with distributed components such as a het- 
erogeneous LTE system with macrocells and femtocells. Existing 
work on LTE systems often involves centralized techniques 
or requires significant signaling, and is therefore not always 
applicable in the presence of femtocells. More distributed CSMA- 
based algorithms (carrier-sense multiple access) were developed 
in the context of 802.11 systems and have been proven to be 
utility optimal. However, the proof typically assumes a single 
transmission rate on each carrier. Further, it relies on the CSMA 
collision detection mechanisms to know whether a transmission 
is feasible. 

In this paper we present a framework for LTE scheduling that 
is based on CSMA techniques. In particular we first prove that 
CSMA-based algorithms can be generalized to handle multiple 
transmission rates in a multi-carrier setting while maintaining 
utility optimality. We then show how such an algorithm can be 
implemented in a heterogeneous LTE system where the existing 
Channel Quality Indication (CQI) mechanism is used to decide 
transmission feasibility. 

I. Introduction 

Interference mitigation is a fundamental problem in wireless 
networks. The exact method for handling interference depends 
on the nature of the network, e.g. whether it is a centrally 
controlled cellular network or a more unstructured ad-hoc net- 
work. For cellular networks interference can be mitigated via 
techniques such as power-control, frequency reuse, and fine- 
grained rate control based on channel-quality measurements 
together with some aspect of central planning. On the other 
hand, for ad-hoc networks, especially those running the 802.1 1 
protocol, interference is typically mitigated by a distributed 
collision-based random access scheme, perhaps coupled with 
a fairly coarse-grained rate-adaptation procedure. 

In this paper we are concerned with interference mitiga- 
tion in cellular systems with distributed components such 
as heterogeneous 4G LTE systems that include small cells. 
Small cells are basestations that aim to provide high data rate 
coverage over a small high-traffic area. For example, picocells 
are owned by a cellular provider and placed on public locations 
such as lamp posts. Alternatively, femtocells are owned by an 
end-user with the aim of improving coverage in a private home 
or business. An important property of femtocells is that they 
can operate in Closed Subscriber Group (CSG) mode in which 
the basestation restricts the set of mobile terminals that can 
connect to it. Another effect introduced by femtocells is that 
interference to macrocell users can now come from a femtocell 



in the interior of the macrocell itself, not just from neighboring 
macrocells. 

LTE networks with small cells represent a hybrid of tradi- 
tional cellular networks and traditional ad-hoc networks. On 
the one hand, basestations are running the full LTE protocols 
which allows for the many interference mitigation schemes 
that these protocols provide. On the other hand, the placement 
of picocells and femtocells in an LTE network is likely to be 
unstructured and so the interference configurations are likely 
to resemble a typical ad-hoc configuration. As a consequence, 
there is no hope for any centralized planning, which is a 
possibility for cellular network interference mitigation tech- 
niques such as frequency reuse. We therefore need distributed 
algorithms. 

We are interested in scheduling algorithms for a heteroge- 
neous system that consists of a mixture of macrocells and 
small cells. We wish to determine the channels, or carriers, 
used by each basestation as well as the transmission rate on 
each channel. This should be done in order to maximize a 
utility function associated with the system. Although a number 
of scheduling algorithms have been proposed in the LTE 
context, many of them require a non-trivial amount of signal- 
ing among the transmitters. For example, in some algorithms 
a scheduling decision is preceded by a calculation of how 
the decision would affect the overall system utility, e.g. by 
exchanging partial derivative information between neighboring 
transmitters. This is difficult to support in heterogeneous 
networks with small cells due to the complexity of setting 
up the necessary communication channels. 

Our main result is to show that LTE scheduling in heteroge- 
neous networks can be performed using techniques developed 
in the context of 802.11 networks. These networks utilize 
a Carrier-Sense Multiple Access (CSMA) protocol, often 
enhanced with a Request-to-Send/Clear-to-Send (RTS/CTS) 
mechanism ifTTIl . In this setup, transmitters sense the channel 
before transmitting and proceed only if no conflicting trans- 
missions are active. 

The attractiveness of this framework is that CSMA schedul- 
ing algorithms can achieve optimal throughputs without any 
explicit signaling. Coordination is implicit in the "collision" 
mechanism defined by the CSMA mechanism. In particular, 
Jiang and Walrand showed in [10] that such mechanisms can 
be used to achieve any set of feasible throughputs. Among 
the sequence of papers that followed, Liu et al.[14] presented 



utility-optimal algorithms for CSMA networks with a single 
carrier and single transmission rate. Two subsequent papers 
discussed implementation issues associated with these algo- 
rithms lfl3l and presented an extension to the case of a single 
transmission rate on multiple carriers lfT31 . However, to the 
best of our knowledge no previous work has looked at how 
such techniques can be applied in the LTE context. 

A number of issues arise if we are to use CSMA-based 
scheduling algorithms for LTE. First, LTE networks utilize an 
OFDM physical layer which consists of multiple transmission 
rates over multiple carriers realized by multiple adjustable 
power levels. Hence, in addition to deciding when to transmit 
on each carrier as in [10|, lfl4l . Ifl5l . the scheduler now 
also chooses the power level used and the transmission rate. 
Second, LTE networks do not have an explicit carrier sense 
mechanism to detect conflicting transmissions. We need to 
build this capability via the existing Channel Quality In- 
dication (CQI) mechanism. Lastly, each basestation in an 
LTE network typically has its own local scheduler, such as 
Proportional Fair, that governs the transmissions to the users 
within the cell. We need a mechanism that allows the existing 
local scheduler to work with the CSMA-based algorithm. 

The main result of this paper is a scheduling algorithm 
for utility maximization in heterogeneous LTE networks. Our 
methodology is motivated by the CSMA analysis of |14|. 
We believe our main contribution is in showing that the 
analysis can be adapted for the case of multiple per-carrier 
transmission rates and powers, and (perhaps more importantly) 
describing how the algorithms can be implemented using the 
CQI mechanism present in LTE. 

We begin with a more abstract version of the algorithm in 
which details of the interference are abstracted away into a 
feasibility region for the transmissions. Many of the existing 
CSMA algorithms implicitly work with this notion. We ini- 
tially assume an "oracle" that informs a transmitter whether 
a potential transmission would disrupt existing transmissions. 
The later sections of the paper discuss how to realize such 
a scheme in a heterogeneous LTE system in practice. We 
structure the paper as follows. 

• In Section [II] we present an abstract model of a multi- 
carrier system that allows for multiple transmission rates 
on each carrier. This model assumes that each transmitter 
knows whether a potential transmission is feasible. 

• In Sections IIIpV we adapt the Liu et al. single-carrier 
single-rate utility maximization algorithm for CSMA to 
the abstract framework of Section [II] to address the 
general case of multiple transmission rates on multiple 
carriers. 

• In Sections [V] and VI we give a concrete model for het- 
erogeneous LTE networks with small cells, in which we 
address power level and interference directly instead of 
via the notion of a feasibility region for transmission. We 
also discuss practical issues such as CQI-based collision 
detection and incorporating a local scheduler. 



In Section VII we present simulation results. 



In Section VIII we give an overview of past work on 
scheduling and resource allocation in cellular and 802.11 
networks. 



II. Abstract Model 



We begin by describing an abstract model that captures the 
notion of multiple transmission rates on multiple carriers and 
variable-power scheduling. We consider a system in which a 
set of transmitters communicate to a set of receivers via a set 
of links £ on a set of carriers C at transmission rates from 
a set 1Z of positive numbers. Each link is associated with a 
transmitter and a receiver, where multiple links may share a 
common transmitter but each link corresponds one-to-one with 
a receiver. 

We solve a scheduling problem, i.e. at each time instant 
we specify for every carrier the links that are transmitting 
on that carrier together with the associated transmission rates. 
More precisely, we represent a schedule on a carrier c E C 
by a vector (r , . . . , Tl-i) € TZq where I — \C\ and TZq = 
1ZU {0}. Such a schedule is feasible if it can be realized by 
an appropriate power allocation so that every link £ G C can 
transmit from its transmitter to its receiver at rate re on carrier 
c simultaneously. A scheduling algorithm describes a schedule 
for every carrier c € C at every time instant. Note that a link 
is allowed to simultaneously transmit on multiple carriers. 

This abstract model captures interference and power assign- 
ments by the notion of a feasibility region, which consists 
of all valid schedules. In the next two sections we describe 
our basic algorithmic framework in this abstract model for 
which we do not concern ourselves with how the system 
knows whether a schedule is feasible but simply assume an 
oracle that indicates whether a potential transmission leads 
to feasibility. As mentioned before, for 802.11 networks this 
can be approximately realized by CSMA techniques coupled 
with RTS/CTS messages. In later sections we describe how 
the oracle can be realized in an LTE heterogeneous network. 

We consider the problem of system utility maximization. In 
particular let jg jC indicate the transmission rate on link £ on 
carrier c. For a given concave utility function [/(•) we wish to 
maximize the aggregate utility over all links, i.e. to maximize 
Tit ^(ScT^c)- Note that for each link the utility function 
is applied to the total transmission rate on the link over all 
carriers. This coupling between the carriers implies that we 
cannot simply treat each carrier as an isolated system. 

A formal version of the optimization problem is given 
below. Consider a schedule m € AT C where J\f c is the set 
of feasible schedules on carrier c. If 7r TO £ [0, 1] indicates 
the fraction of time that m occurs and r^ m indicates the 
transmission rate on £ under schedule m, then 7^ c can be 
viewed as a weighted sum of r^ m where ir m serves as the 
weight. Throughout the paper, notations such as 7 indicate a 



Algorithm UL1 RandAcc((£, r, c), A, fi, T) 

• £ : link 

• r : rate 

• c : carrier 

• A : channel access rate 

• fi : expected transmission duration 

• T : time frame 

t <— beginning of T 
while t £ T do 

x <— randomly drawn from Exp(X) 
t^t + x 

if (£, r, c) leads to config. in Af c then 
x <— randomly drawn from Exp(l/fi) 
£ transmits on c at r during (t, t + x] 

t <-t + x 



vector (7£,c)£e£,e£C- We wish to solve: 

max fi (7) = E ^ (^2 ji tC ) 



Algorithm III.2 MMUO 



s.t. 
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< 
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mG7V c 



7T m = 1 Vc. 



We can view this as an extension of the formulation of lfl5ll 
that allows for multirate transmissions (and implicitly variable 
transmission powers). 

In reality the sets C and J\f c can change over time due to 
mobility. However, as in common in the literature we assume 
that this happens on a slow enough timescale that it makes 
sense to solve the utility maximization problem for the current 
network configuration. 

III. Multi-carrier, multi-rate scheduling 

ALGORITHM 

We first describe a routine RandAcc (in Figure Algo- 
rithm III. 1 1, a continuous-time random access algorithm that 
determines when a transmission will take place. We use 
(£, r, c) to denote the transmission on link £ £ C on carrier 
c £ C at rate r £ 1Z. Each transmission (£, r, c) is associated 
with two parameters, A^ r , c the channel access rate and fi£ iTC 
which represents the expected transmission duration. After an 
exponentially distributed waiting period with mean 1/Xi tr<c , 
RandAcc checks whether (£, r, c) leads to a valid schedule 
in Af c at that time instant. If yes, the transmission starts 
immediately and lasts for an exponentially distributed time 
period with mean pe,r,c- Note that an invalid schedule includes 
the situation in which £ conflicts with itself, namely I already 
transmits on c, or I conflicts with another link on c, both of 
which are captured by M c . Note also that since RandAcc 
operates in a continuous manner, two links make a scheduling 
decision simultaneously with zero probability and therefore 
they make conflicting decisions with zero probability. 



for each time frame / do 

for each link I, carrier c and rate r do 

RANDACC((^, r,c), \t,r,c[f]> MrAfl /) 
for each I do 

update service received Sg[f] 

update virtual queue qi[f + 1] according to (j2j) 
for each £, c and r do 

update A^ r , c [/ + 1] and M£,r,c[/+1] according to ^ 



The algorithm MMUO (multi-carrier multi-rate utility op- 
timization) approximates a solution to ([T]i as follows (see 
Algorithm |III.2[ ). Time is divided into frames of fixed duration. 
During each frame /, each potential transmission (£, r, c) calls 
the RandAcc routine with parameters A^ rx [/] and fJ,£, r ,c[f]- 

At the end of each frame, link £ calculates the service 
received during the frame (denoted by Self]), and updates 
a virtual queue size parameter (denoted by qi[f\) as follows. 



(2) 



In the above equation, b is a step size function that satisfies 
property (Al) (defined later), q max and q m in are bounds on the 
virtual queue size, and = min(q max , max(g min , x)). 

The positive parameter V controls the accuracy of the algo- 
rithm. 

The values of Xi >r ,c[f] an d Hi,r,c[f} stay unchanged during 
each frame /, and are updated to A^ rc [/+1] and £*f r,c[/ + l] 
at the end of frame /, so that 



Xl,r,c[f + !] ' + !] = ex P( r ' 1t\I + !])• 



(3) 



As we shall see in Q the performance of RandAcc depends 
on the product of A and //. The choice of this product is 
explained in the proof of Theorem [4] 

IV. Analysis 

We now show that the MMUO algorithm leads to an optimal 
solution to ([TJ. The proof builds upon the following optimality 
properties of RandAcc that were shown in ifTOl for the 
special case of one carrier and 0/1 transmission rates. 

A. RandAcc for 0/1 Rates 

Suppose each link £ £ C calls RandAcc with parameters 
(Xe,Hf,) over a sufficiently long time frame. Let A = {\g)i^c 
and let fl = (fie)e e c- Let m x,fl {t) be the schedule at time t 
under the RandAcc routine. Recall r^ m is the transmission 
rate on link £ under schedule m. In this case r^ m £ {0, 1}. 

Lemma 1 ( iTPZll ). The sequence of schedules m A '' i (t)/or t > 
form a continuous time reversible Markov Chain with the 
following stationary distribution. 



r A c ,/x c 



LL 



■re,, 



% =i^e ■ [it 



J2rn' n&r^ m /=lA| ■ Hi 



Vra, 



where ILj e0 (-) = 1. 



E, 



Further, the resulting link throughput r y £ 



A./7 



is optimal for every link I G £ in the 



following sense. We say that a link throughput vector 7 is 
feasible if there exist 7r m such that 7^ < J2 m eAf r i,m 7T m- 

Lemma 2 ( 11101 ). For any feasible link throughput vector 7, 
there exists A and jl such that 



It 



B. MMUO 

For multiple transmission rates on multiple carriers, the gen- 
eralization from Lemma [TJ to Corollary [3] is straight-forward. 
This is because RandAcc that runs on one carrier does not 
interfere with that on a different carrier. Further, each rate-link 
pair can be treated as a distinct link. For a particular carrier 
c, let vectors A c = (\i, r ,c)lec,reli and fl c = (pte,r,c)eec,reTZ- 
Recall M c is the set of feasible schedules on carrier c. 

Corollary 3. The schedule sequence m Ac,Alc (i) for t > is a 
continuous time reversible Markov Chain with the following 
stationary distribution. 

IL ■■re m >0 Xt 



Vm e M c (4) 



The above expression shows that the stationary distribution 
only depends on the product of the parameters A and p,. 
Whenever MMUO invokes RandAcc, it does so with A and 
H parameters that are set according to Q. For any vector of 
virtual queues q = (qg)iec^ we denote by ir q the resulting 
distribution on Af c from the RandAcc routine. From Q we 
have, 



exp 



re, m > 



T t,m ■ < 



m / 

, >0 r t,m' ' Qi 

The resulting link throughput is therefore, 



2^ ^ m r t,m W,C 



Vm g A/" c (5) 



(6) 



For utility optimization, our goal is to show that the virtual 
queues under MMUO converge to a vector q* so that the above 
link throughput under q* maximizes the utility as defined 
in (|TJ. Note that for this problem we cannot treat each carrier 
in isolation since the throughput of a link is aggregated 
over all carriers. Note also that the optimization problem 
(|7]i in Theorem |4] differs from that of ([T]i in its objective 
function, but shares the same constraints. The motivation of 
this reformulation is to obtain a more useful set of KKT 
conditions. Further, Theorem |4] will state that the optimal 
values of the two objective functions can be arbitrarily close. 
The main result of this section is captured in the following 
theorem. It relies on two standard but technical assumptions 
(Al) and (A2) which we detail below. 

Theorem 4. Under assumptions (Al) and (A2),for any initial 
condition q[0], MMUO converges in the following sense. 

lim q[f] = <f 

f*-oa 



where q* and 7* are such that (7* , ■n q ) is the solution to the 
following convex optimization problem over 7 and ff. 

max / 2 (7,ff) = 

^E^E^-E Z) vr ro log7r m (7) 

leC c c m£j\f c 

S-t. lt.c< E^ r l,mK m V£,C 

7T m = 1 Vc. 

m£Af c 

Further, if 7^ is the optimal solution to |7|), then 

l/i(7*) -/i(7 f )l < |C|log|U c AA c |/U (8) 



The assumptions of Theorem |4] are: 

(^41) E7=o b[f] = 00 and J2%o b 2 [.f] < 00. 
(A2) lfp° 



£ is a solution to 



E 



E 



= g C 



then g min < < 9max for all I G £. 
The parameters &[•] will only be used in the analysis, not in 
the algorithm itself. In addition, the parameters q m [ n and g max 
are under our control. Hence for any problem instance we can 
make sure that Assumptions (Al) and (A2) hold. 

Proof: There are several steps in the proof. In the first 
two steps we follow the framework of [14] to show that 
the dynamics of MMUO can be captured by a system of 
differential equations. In the third step we must deviate from 
the approach of [14] in order to handle multiple transmission 
rates on multiple carriers. 

We begin by replacing the discrete time frames of MMUO 
with a more convenient continuous interpolation. For notation 
we use square brackets [•] indexed with integers / for discrete 
sequences defined on frames /, and we use round brackets (•) 
indexed with real numbers t for a continuous scaled version of 
time. For the discrete time sequence of virtual queue vectors 
q[f] for integral frames / = 1,2, ... , we define as follows 
its continuous interpolation q(t) for all real positive numbers 
t. We also define a continuous version Si(t) for the discrete 
service sequence Si[f]. For / = 1, 2, . . . , let tf = Ei=i ^W- 
For t G [t f ,t f+1 ), let 



s e (t) 



St\f\ 



(In other words the continuous time process t is created from 
the discrete time frames scaled by the intervals [tf, tf+i).) 

The following lemma is shown in [14| and says that the 
continuous sequence q(t) converges to the solution of the 
system of stochastic differential equations |9]) which can be 
viewed as a continuous version of The equations are 
stochastic due to the term Si(t) which is the result of the 
stochastic process of MMUO. 



Lemma 5 ( 1141 ). Let p* be the solution to the following system 
of differential equations with variable p = (p)eec- 



Hence the KKT conditions for |7]) are: 



VI, c 



(11) 



Pi 



where X r 



U'- l { Pl /V)-S t {t) 



1 



pi e 1<j,t 



(9) 



- 1 - log 7r m + 2J r^ m v^c -r] c = Q Vm € A/" c , V£12) 



in f/ze range of [q 
<f(r), f/ien lim T 



w an indicator variable for whether p? is 
mm> 9max]- Fix any time instant r. If p*(r) — 

SU PtG[r,r+T] 



11, c 



< 



\\p*(t)-q(t)\\ = o- 



E 



W,c 



The second step of the proof says that any fixed point 
of the stochastic system |9]l will also be a fixed point of 
an associated deterministic system of equations. Note that 
the RandAcc routine may not converge to the stationary 
distribution of Q, or equivalently (|5j, with the given A and 
/i parameters within each frame before the parameters are 
updated to their respective new values for the next frame. 
Hence the service Se(t) in the above differential equations is a 
stochastic quantity. In other words, when MMUO invokes the 
RandAcc routine, if each frame / is sufficiently long, Sg(t) 
would converge to its long-term average J2 m eM a r A«i' 7r l 
and we could replace the stochastic term Se{t) in (|9| by its 
long term average. This would result in the following system 
(|T0||. Unfortunately, in reality, the frame / may not be long 
enough. However, the following result still allows us to say 
that systems (|9]l and ( [TO} are closely related, regardless of the 
convergence of RandAcc. As we shall see, ( fTO] ) also provides 
a connection to solving (|7]l via the KKT conditions. 

Lemma 6 ( 1141 ). Every limit point of system (|9| is almost 
always a fixed point of the following system. 



meJ\f c 



re, m TT m ) = W, c 



E 7r - 



ve, c > 
-1 = 



W,c 
Vc 



(13) 
(14) 

(15) 
(16) 



Inequalities ( fll) and ( 12 1 state the gradient of the Lagrangian 
£(7, 7? : v : ff) is zero with respect to the variables 7 and 
tt. Inequalities \Yi) , (14i and (15 1 state that the first set of 
constraints of fjj hold and has zero duality gap. The last 
equality ( [To} states the second set of constraints of (|7]i hold. 

We introduce a new variable p — {p)iec an d set me primary 
variables via 



T^m = 7if n as in 
and the dual variables via 



(5) 



Vm 



PC 



log I ex P(E r *,™Qt) ) 



Vc. 



where is defined in ([5]). 



■ 1 



Pe e [q n 



We can see that the KKT conditions ( |T2j >, (jT5l and ( fTS} are 
easily satisfied due to the definition of TT P n infpll and as long 
as p G 3?i. 



We next aim to satisfy the remaining KKT conditions ( 1 1 



W G C, (10) 



( 13 1 and ( 14 1 via the subgradient method. From (Hi, ( 14 1 and 
the fact that vi, c is set to pi, we wish to have, 



In the third step of the proof we show that the system 
( [T0| > leads to a solution of the optimization problem |7]). (It 
effectively solves the problem via the subgradient method). 
For this step we must deviate from the analysis of lfT4l in 
order to handle the multi-carrier and multi-rate aspects of Q. 
In particular, the Lagrangian of (|7]i is given by: 

= Er • C/ (E^)-E^^) 

£ec \ c c / 

+ E E^> c E r ^'» ?r ™ ~ E 7r TO 1 °g 7r "i) 

- E^ c E • 



x (t/'-^p./y) - Y E = °- 



w 



If this does not hold (and hence we are not yet at optimality), 
the standard subgradient method updates the pe according to 
the following system of differential equations. 



pt = U'-\p t /V) 



EE 



p 



(17) 



Due to the convexity of the problem |7]i the system ( [T7| will 
eventually converge to a fixed point, p*. Note that the system 
(17 1 is identical to (10 1 as long as pe G [q m i n , q m ax] f° r a H 
I G C. Due to assumption (A2) and the definition of vt tC , 
the solution to the dual of (|7]i without the constraint pg G 
[?mi n ,?max] falls in the range of [q min ,q mscx ] and is therefore 
equivalent to the fixed point p' 

With p* chosen, KKT condition (JTT » 
set 



of the system ( 17 1. 

is satisfied. We now 



le, 



= V ^ 

m£Af c 



re. 



which satisfy the remaining conditions of (13i and (14 1. 
Finally, since 

fi(f) < /i(7 f ) 

and the entropy X) m 7r m log 7r m < log | U c Af c \, the proof of 
Theorem [4] is complete. ■ 
We conclude by briefly summarizing in what sense we 
have shown that MMUO is optimal. We have shown that 
an appropriate continuous interpolation of the virtual queue 
dynamics approaches in the limit a vector p* that defines an 
optimal dual solution of |7]i (via the KKT conditions). Via 
Q the optimal virtual queue sizes determine channel access 
parameters \g r c and [i£ rc for which the corresponding link 
throughputs provide an optimal primal solution to problem fT). 

V. A More Concrete Model: Heterogeneous LTE 
System 

In this section we present a more concrete model for the 
scheduling problem so that it more closely matches resource 
allocation in LTE heterogeneous networks. 

We begin with a brief system description. We consider 
downlink transmissions from a set of basestations to a set of 
mobile users in a time-slotted system. We assume an OFDM- 
based air interface in which the spectrum is divided into a set 
of carriers called resource blocks (RBs), each of which can 
be scheduled separately. For example a 20MHz LTE system 
is typically divided into 100 resource blocks. In the time 
dimension a time slot corresponds to a Transmission Time 
Interval (TTI) which has a typical duration of 1ms in an LTE 
system. 

We consider a heterogeneous network in which the bases- 
tations are divided into two classes, namely macrocells and 
femtocells. (For ease of description we use the terms "macros" 
and "femtos". However, our discussions also apply directly 
to networks with picocells.) Macros typically have a much 
higher max transmit power than femtos, since macrocells 
provide wide-area coverage, whereas femtocells (which may 
be privately owned) provide focused coverage in one specific 
location, e.g. a house or apartment. At any time instant 
each mobile user associates with one basestation. Each macro 
accepts an association with any mobile user. A femto however 
may be in "Closed Subscriber Group" mode (CSG) and only 
accept an association with a small subset of users. We remark 
that femtos have two notable effects that are departures from 
traditional cellular networks. First, they may create strong 
interference to a macrocell from within the cell itself, whereas 
in a macro-only network interference to a cell mostly comes 
from outside that cell. Second, a mobile user may not be able 
to associate with the basestation with the strongest signal if the 
basestation is a femto in CSG mode and cannot associate with 
the user. We assume that each mobile user associates with the 
basestation for which the received signal is strongest, among 
those that the user is able to associate with. 

Unlike in the abstract model, we address basestation power 
allocation and interference directly instead of via the notion 



of feasibility regions. For basestation i, let [/,; be the set of 
associated users. To abuse notation, we also use [7, to denote 
the set of links that are incident to i as there is a one-to-one 
correspondence between the users and links. 

The maximum transmit power pi for basestation i is given 
and fixed. The scheduling problem is how to distribute pi 
among the resource blocks c £ C and among the users 
in Ui. Let Pi, c (t) be the power allocation of pi on re- 
source block c at time t; let Pi. c j(t) be the allocation 
of p itC (t) on user j G f/j. Note J^ c Pi, c {t) < Pi, and 

{ Pi ' cA % = P n iAt) l 0I TJ e Ul ■ That is, Ptc is 

allocated entirely to one chosen user j € Ui. 

Power settings and transmission rates are related through the 
channel quality information (CQI). CQI values are defined on 
pairs of links and resource blocks. During every time slot t, 
the values of CQI C e (t) for all c g C and £ £ U are reported 
to basestation i. We assume that each basestation has perfect 
CQI reporting. 

Let r Ct i{t) be the transmission rate along link £ on carrier 
c during time slot t. Specifically, for link £ = ij between the 
basestation i and the associated user j, we define 



r c,l{t) 

CQIc^t) 



9ijc(t) 



(18) 
(19) 



In ( 19 1, gij c represents the path loss between i and j on 



resource block c, and N c is the background noise on c. Both 
gij C and N c depend on c since radio propagation conditions 
and background interference may be different on different 
frequencies. The product of Pi lC ,j an d CQI C e is commonly 
referred to as signal-to-interference-plus-noise ratio, SINR. 
(We can therefore think of CQI C £ as the SINR for a unit 



power transmission.) In ( 18 I, w c is the bandwidth of resource 
block c and F(-) represents spectral efficiency as a function 
of SINR. For example F(-) could be a suitably discretized 
version of the Shannon function log(l + x). We assume that 
F(-) is such that r c j is always a member of a discrete set 
ftU{0}. 

The primary scheduling decision is to determine the power 
levels p i c . In the literature this problem is sometimes known 
as inter-cell interference coordination (ICIC). The secondary 
scheduling decision is to allocate pi :C to the user-level power 
Pi,c,j- Typically, each basestation in an LTE network has its 
own local scheduler for user-level allocation, in which case the 
scheduling freedom is at the inter-cell level. For concreteness 
we assume the local scheduler uses the Proportional Fair (PF) 
algorithm. In the following section we give details on how 
to allocate p-^ c and Pi, c ,j- For user-level allocation Pi, c ,j, we 
consider two cases depending on whether a local scheduler 
exists. 

VI. Implementation 



In Section III we described the utility-optimal MMUO 
algorithm for the abstract model. In this section we present 



MMUO-based heuristics for the LTE resource allocation prob- 
lem in heterogeneous networks. We address a number of 
issues. First, scheduling decisions need to be made in slotted 
time rather than in continuous time as in MMUO. Second, 
scheduling decisions are about setting power levels rather than 
transmission rates as in MMUO. Third, we discuss how to 
incorporate a local scheduler such as Proportional Fair. Fourth, 
perhaps most significantly, we show CQI-based methods for 
feasibility detection. This replaces the feasibility oracle and 
the CSMA collision detection mechanism. Lastly, since the 
basestations are divided into two classes, macro and femto, 
interference can be reduced by not having every basestation 
compete on every resource block. 

We begin with a basic heuristic that bypasses the last 
two issues. We then describe three methods through which 
feasibility can be detected in practice. We conclude with a 
modified heuristic in which macros and femtos have priority 
on different sets of resource blocks. 

A. Basic Heuristic 

Our basic heuristic works very much in the spirit of MMUO. 
To address the first issue regarding slotted time each frame 
now consists of an integral number of time slots. When the 
subroutine RandAcc is called with parameters A and /i, the 
time between transmission attempts (resp. the transmission 
period) is drawn from a geometric distribution with mean 
1/ A (resp. mean fi). One problem is that two links may make 
decisions during the same time slot. We can set the 1/A values 
large enough so that this rarely happens. If this does happen we 
assume that both conflicting transmissions cease. A detailed 
explanation of how rare collisions affect the performance of 
utility-optimal CSMA was given in lfT4l and we can apply a 
similar analysis to MMUO. 

The output of MMUO, as described above, specifies the 
transmission rate r c ^(t) for every transmission (I, r, c) that 
takes place during time slot t. To obtain power settings, 



equation ( 18 1 provides the direct translation from transmission 



rates to power levels. 



(*) 



CQI^(t) 



for link l = ij. (20) 



For a given basestation i and resource block c, the feasibility 
oracle guarantees that one user j g {/, has positive power 
allocation Pi, c ,j(t). Let Pi lC (t) = Pi, c ,j(t) for this user j. The 
transmission now takes place as long as J^ c f>i )C (i) < Pi- This 
addresses the second issue. 

If it is the case that we can specify both user-level as 
well as inter-cell power allocations, we are done. However, 
as discussed in Section [V] in many instances we only have 
the freedom for specifying pi >c since the user-level power is 
determined by a local scheduler such as the Proportional Fair 
(PF) algorithm. In this case we run the MMUO algorithm "in 
the background" to compute the p ic values and then determine 
which user in Ui receives the transmission power p^ c using 
the PF algorithm. (We remark that once the power levels 



are set then which user is chosen by PF does not affect the 
interference experienced in other cells.) 

The PF algorithm works as follows. For each link £ e U. L 
basestation i maintains an estimate Re of the recent average 
transmit rate on link I, and allocates power pi tC exclusively to 
the link I that maximizes the ratio r Cf i{t)/R.£, where f c e{t) 
is the nominal rate if user j has power allocation Pi. c .j(t) = 
Pi,c(t)- Again from (18i, we have 



fed*) 



F ( ftjCii (t) • CQI Ci< (t)) for link £ = ij (21) 



After each scheduling decision Re is updated for each link 
according to an exponential filter. This addresses the third 
issue. 

B. Methods for Implementing the Feasibility Oracle 

We now examine options for the only part of the algo- 
rithm that requires coordination among basestations, namely 
feasibility detection. Since resource block power assignment is 
typically done on a slower timescale than individual time slots, 
we are interested in determining whether a set of transmissions 
will be feasible over multiple timeslots. In particular, we do 
not want to declare a transmission feasible if this is only true 
for a single timeslot due to fast fading. 

We discuss multiple mechanisms which all use techniques 
that have been proposed in the standardization process for 
heterogeneous networks (e.g. [1|). Our initial mechanisms use 
the existing CQI channel with one extra piece of information 
which we call the activity indicator. We also allow for a 
basestation to "overhear" a link to which it is not associated. 
Our later mechanisms show how the algorithm could be 
implemented if we indeed have a channel for exchanging 
information between basestations (such as the X2 channel that 
is defined in LTE). The bit-rate of such channels is typically 
limited and so we stress that all we need to exchange are short 
messages such as the activity values. No detailed exchange of 
channel state is required. 

Method 1: In this method, an activity indicator is reported 
along with the CQI. Specifically, let ye, r ,c{t) be the binary 
activity indicator that is set to one if and only if MMUO makes 
a transmission (£,r,c) during time slot t. When CQI Cj £(i) is 
reported to basestation i for which I 6 Ui, ye, r ,c(t) is also 
reported if it is set to 1. Each basestation i listens to all 
CQI that it can decode, not just the CQI for links in f/j. If 
i hears ye>, r '.c = 1 for some £' on resource block c, then 
every potential transmission (£, r, c), for £ £ Ui and r e 1Z, is 
declared infeasible. Note that this method is similar in spirit 
to the Clear-to-Send (CTS) mechanism for 802.11. 

Method 2: This method is less stringent than Method 1 in 
declaring infeasibility. For each activity indicator yi rc {t) = 1 
we define the safety margin to be the ratio between the 
currently achievable transmission rate r C: e{t) and the actual 
rate r c ^(t) that is used by MMUO. This achievable rate 
can be computed from the CQI C) ^(t) values together with 
the current power levels. We assume that the safety margins 
are transmitted on the CQI channel along with the activity 
indicators. For some threshold v > 1 we say that the activity 



indicator is safe if the safety margin is above v, vulnerable if 
the margin is between 1 and v, and in outage if the margin is 
below 1. (Note that if we are in outage then user I could not 
receive data at rate r for the current CQI values.) Method 2 is 
the same as method 1 except that basestation i does not refrain 
from declaring a potential transmission on I G Ui on block c 
feasible, even if it overhears an activity indicator ye,r',c(t) = 1 
as long as this indicator is currently safe. 

The exact value of v could be a network-wide config- 
ured parameter. Alternatively each basestation could gradually 
lower a local estimate of v until it observes links going into 
outage. 

Method 3: This method applies probing to feasibility 
detection. Whenever a basestation i needs to decide if MMUO 
could transmit on (£,r,c), for I e Ui, it briefly sets power 
level pi, c on resource block c and observes the effects on 
other users. Here p ic is the power necessary to carry out 
the transmission (£,r,c) and can be calculated as in (20i. 
If basestation i overhears that any activity indicator moves 
into outage then it sets pi tC back to and declares (£, r, c) 
infeasible. This method has the drawback that it could send 
neighboring users into outage for short periods (and this would 
need to rectified by more robust channel coding on the data 
channels). However, it has the advantage that basestation i gets 
a much better sense of the "damage" that might be caused by 
setting a particular power level pi tC on resource block c. 

Methods 4-6: The next three methods are essentially 
the same as Methods 1-3. However, instead of basestations 
overhearing activity indicators and their associated safety 
status, each basestation would directly communicate their own 
activity indicators and safety margins to all their neighboring 
basestations. This can be done using a channel such as the 
X2 channel in LTE that provides communication between 
neighboring basestations. Note that this is a lightweight com- 
munication since the activity indicator only has 2 possible 
values and the safety status has only 3 possible values. 
In particular the basestations would not be exchanging any 
detailed channel state information. 

Relationship to current LTE proposals: We now briefly 
discuss how the above methods could fit with mechanisms 
that have been proposed in LTE standards for interference 
coordination. In the document [llj on RF requirements for 
femtos, three options are proposed for communication between 
macros and femtos. The first is direct over-the-air communica- 
tion. The second is over-the-air via "victim" users. This would 
correspond to the overhearing methods 1-3 proposed above 
in that the victim user broadcasts channel quality information 
that indicates to an interferer whether it is safe to transmit. The 
third option is via an existing backhaul which corresponds to 
methods 4-6 above. 

Enhanced heuristic: Resource block prioritization:: Note 
that throughout this section we have assumed that attempts 
by a link to access resource block c are governed by Xg. ryC 
and [li r e and are performed independently across resource 
blocks. However, since in heterogeneous networks we have 
two classes of basestations (macros and femtos) there is 
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Fig. 1. Network configuration: 1 macro and 2 femtos each with 2 users. 



potential to reduce interference if each class has priority on a 
different set of resource blocks. We now describe a heuristic 
to achieve this. In particular if basestation i is a macro we 
bias it towards low numbered resource blocks by only letting 
one of its users be active on a resource block if it also has 
active users on all lower numbered resource blocks. More 
formally, if £ G (7, then {£, r, c) is feasible if for all d < c 
there exists £' G Ui and r' G 1Z such that yt' <r ',c' = !■ 
Similarly, if basestation i is a femto we bias it towards high 
numbered resource blocks by only letting one of its users be 
active on a resource block if it also has active users on all 
higher numbered resource blocks. More formally, if I E Ui 
then (£, r, c) is feasible if for all d > c there exists £' G Ui 
and r' G 1Z such that yi> , r ' ,d = 1- 

VII. Simulation Results 

We now provide an example to show how the algorithms 
work. We consider a simple toy example since it allows us 
to compute the optimal schedule. We consider three omnidi- 
rectional basestations, 1 macro and 2 femtos, together with 
six users, two for each basestation. The two macro users are 
at distance 100m (user 0) and 780m (user 1) respectively. 
Each femto has two users (users 2-5) at distance 10m. The 
exact configuration together with the user numbering is shown 
in Figure [T] We assume that user 1 has to be associated 
with the macro since the nearby femto is in CSG mode. The 
transmit power of the macro is 46dBm and for the femtos it is 
8dBm. The system bandwidth is 5MHz and the noise density 
is -165dBm/Hz. We split the system bandwidth into three 
resource blocks. The pathloss is represented by a COST-231 
Hata model. In particular the path loss at distance d meters is 
assumed to be 0.525 *<i~ 3523 . For simplicity we consider two 
instantaneous transmission rates, a "low" rate of 8bits/sec/Hz 
and a "high" rate of 16bits/sec/Hz. We consider blocks. 

For this configuration we aim to achieve user through- 
puts that solve problem We can compute the follow- 
ing optimal solution offline via a standard subgradient al- 
gorithm. For brevity we use notation of the form 0h2£A£ 
to represent a schedule, which means user is receiving 
data at the high rate and users 2 and 4 are receiving 
data at the low rate. In the optimal solution for schedules 
to G {0h2£4£,0h3£U,0h2£5£,0h3£5£} we have ir m = 8.4%, 
for schedules m 6 {lhA£, lh5£} we have n m = 10.7%, 
and for schedules to G {2hAh, 2h5h, 3h4h, 3h5h} we have 
7r m = 11.2%. Figure [2] shows the corresponding optimal link 
throughputs as computed by the subgradient algorithm. 

The behavior of MMUO and MMUO with Proportional Fair 
are similar. In the interest of space, we present the plot for 
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Fig. 2. Link throughputs under the optimal solution. Top curves for 
users/links 0, 4 and 5; middle curves for users 2 and 3; bottom curve for 
user 1. (Note that some curves coincide.) 




Fig. 3. Link throughputs under MMUO in combination with PF. Top curve 
for user 0; second curve for users 4 and 5; third curve for users 2 and 3; 
bottom curve for user 1. 



MMUO with PF only (in Figures [3J. As we can see, the link 
throughputs closely approximate the optimal rates in Figure [2] 
In both plots, user (the closer user to the macro basestation) 
has the highest throughput while user 1 (the far user to the 
macro basestation) has the lowest throughput. Among the 
femto users, users 4 and 5 have higher throughputs since they 
create less interference to a macro user than users 2 and 3. 

We conclude with a brief discussion of how the algorithms 
for testing feasibility work in this context. In particular sup- 
pose that we are in configuration lh4£ and suppose that user 
3 wants to transmit at the high rate. This is infeasible. User 3 
may discover this by either a) overhearing the CQI reported 
by user 1 and realizing it is sufficiently close to the minimum 
acceptable CQI or b) briefly probing the channel at the high 
rate and then discovering that user can no longer support its 
current rate. In both cases user 3 decides not to transmit. 

VIII. Previous Work 

We now describe how our work relates to existing tech- 
niques. Prior work mainly falls into two categories, resource 
allocation in OFDM systems and CSMA-based algorithms for 
802.11 networks. As we have seen, our proposal has been to 
derive an algorithm based on CSMA techniques for the case 
of OFDM resource allocation. 

OFDM resource allocation: LTE uses an OFDM physical 
layer. Resource allocation in OFDM systems addresses prob- 
lems such as channel selection, local scheduling, power control 
and user association, i.e. which basestation serves which user. 
One popular technique, e.g. used in [6|, J4), (5), is a Gibbs 
sampler approach based on Interacting Particle Systems. The 
main idea here is that for a given network configuration each 



node has a local energy based on the interference that it both 
causes and receives. Nodes then pick new states based on 
their local energy. Gibbs sampler techniques have also been 
used to motivate greedy algorithms for LTE resource block 
selection, e.g. 0. Another popular technique, e.g. used in 
|fl9ll , ITS! , is to set power levels according to a gradient ascent 
approach. In particular each transmitter adjusts power levels 
so as to improve network utility in its neighborhood. Both 
the Gibbs sampler and the gradient ascent based methods 
require information exchange on how much interference each 
transmitter causes to each receiver. For the Gibbs sampler 
methods interference information needs to be exchanged in 
order to calculate local energy levels. For the gradient ascent 
methods nodes need to exchange "partial derivative" informa- 
tion to indicate how the interference they experience would 
be affected by a change in a neighbor's power levels. We 
remark that MMUO does not require such detailed information 
exchange. It bases its calculations on CQI messages that 
are already included in LTE, augmented with the activity 
indicators (and possibly safety margins). 

CSMA-based Algorithms: In the classic CSMA setup all 
links wish to access a single channel. Jiang and Walrand 
ifTUll showed that CSMA can achieve any set of feasible 
throughputs. Since this result, a number of papers have looked 
at how to make channel access rates dependent on local queue 
sizes in order to keep the system stable, e.g. 0, El, 0, [16|, 
IT71 . As already discussed, we have based our analysis on the 
work [14 1 (later extended in fl5l . Ifl3l ) that analyzed utility 
maximization in a CSMA setting. 

IX. Conclusion 

In this paper we have presented a CSMA-based scheduling 
algorithm for heterogenous LTE networks with both macro and 
small cells. Our main contribution is twofold. Mathematically, 
our algorithm handles the general multiple transmission rates 
on multiple carriers and achieves utility optimality. For the 
practical setting, the communication among the basestations 
utilizes the existing CQI-based technology and hence the 
additional signaling is minimal. 
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